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Abstract 

An important first step when deploying a wireless ad hoc network is neighbor discovery in 
which every node attempts to determine the set of nodes it can communicate within one wireless 
hop. In the recent years, cognitive radio (CR) technology has gained attention as an attractive 
approach to alleviate spectrum congestion. A cognitive radio transceiver can operate over a 
wide range of frequencies possibly spanning multiple frequency bands. A cognitive radio node 
can opportunistically utilize unused wireless spectrum without interference from other wireless 
devices in its vicinity. Due to spatial variations in frequency usage and hardware variations in 
radio transceivers, different nodes in the network may perceive different subsets of frequencies 
available to them for communication. This heterogeneity in the available channel sets across 
the network increases the complexity of solving the neighbor discovery problem in a cognitive 
radio network. In this work, we design and analyze several randomized algorithms for neighbor 
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discovery in such a (heterogeneous) network under a variety of assumptions ( e.g ., maximum 
node degree known or unknown) for both synchronous and asynchronous systems under mini¬ 
mal knowledge. We also show that our randomized algorithms are naturally suited to tolerate 
unreliable channels and adversarial attacks. 

Key words: multi-hop multi-channel wireless network, cognitive radio technology, hetero¬ 
geneous channel availability, neighbor discovery, randomized algorithm, asynchronous system, 
clock drift, lossy channel, jamming attack 

1 Introduction 

Neighbor discovery is an important step in forming a self-organizing wireless ad hoc network without 
any support from an existing communication infrastructure [8ll9lll91l20ll291l42ll51ll541l571l581163j . When 
deployed, nodes initially have no prior knowledge of other nodes that they can communicate with 
directly. 

The results of neighbor discovery can be used to solve other important communication problems 
such as medium access control 0 [221, routing m, broadcasting [50], (M], clustering j I (Tj 32) ITT . 
collision-free scheduling 20) ]27] , spanning tree construction [21], and topology control [57, 62]. 
Many algorithms for solving these problems implicitly assume that all nodes know their one-hop 
and sometimes even two-hop neighbors. Many location-based routing protocols (e.g. localized 
routing in Vehicular Ad hoc Networks (VANETs)) use the position of neighboring nodes to make 
routing/forwarding decisions. The neighborhood information is also used to update the reachability 
status of nodes. A better neighbor discovery algorithm, which uses fewer messages and has higher 
accuracy, can be used to improve the performance of location-based routing algorithms mm- 
Neighborhood information helps reduce the cost of flooding in multicast tree construction using 
flooding algorithms [39]. Mobile sensing applications, ranging from mobile social networking to 
proximity-based gaming, involve collection and sharing of data among nearby users. The success 
of these applications depends on neighborhood information [22] • Neighbor discovery is extremely 
important in Underwater Acoustic (UWA) Networks and needs to be done frequently because nodes 
may move proactively due to the unpredictable underwater currents [68] , More details of how the 
results of neighbor discovery can be used to solve other communication problems can be found 
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Cognitive radio (CR) technology has emerged as a promising approach for improving spectrum 
utilization efficiency and meeting the increased demand for wireless communications [13]. A CR 
node can scan a part of the wireless spectrum, and identify unused or underutilized channels in 
the spectrum mm- CR nodes in a network can then use these channels opportunistically for 
communication among themselves even if the channels belong to licensed users. The licensed users 
are referred to as the primary users , and CR nodes are referred to as the secondary users. (Of 
course, when a primary user arrives and starts using its channel, the secondary users have to vacate 
the channel.) Due to spatial variations in frequency usage, hardware variations in radio transceivers 
and uneven propagation of wireless signals, different nodes in the network may perceive different 
subsets of frequencies available to them for communication. This gives rise to a multi-hop, multi¬ 
channel, heterogeneous wireless network, abbreviated as JVpHeW network. The focus of this work 
is on solving the neighbor discovery problem in an M 2 HeW network. 

A large number of neighbor discovery algorithms have been proposed in the literature. Most 
of the algorithms suffer from one or more of the following limitations: (i) all nodes are assumed 
to be synchronized (synchronous system), (ii) the entire network is assumed to operate on a single 
channel (single-channel network), (iii) all nodes are assumed to be able to communicate with each 
other (single-hop network), (iv) all channels are assumed to be available to all nodes (homoge¬ 
neous network), or (v) the algorithm is only evaluated experimentally (no theoretical guarantees 
provided). A more detailed discussion of the related work can be found in Section [71 

Our Contributions: Our main contribution in this work is two randomized neighbor discovery 
algorithms for an AftHeW network when the system is asynchronous that guarantee success with 
arbitrarily high probability. The first algorithm assumes that nodes know a good upper bound 
on the maximum degree of any node in the network. The second algorithm does not make any 
such assumption. Both algorithms only assume that the maximum drift rate of the clock of any 
node is bounded, with the second algorithm assuming a tighter but unknown bound whose value 
depends on various system parameters. None of the algorithms require clocks of different nodes 
to be synchronized. In fact, clocks of any two nodes may have arbitrary skew with respect to 
each other. Other advantages of our algorithms are as follows: (i) nodes do not need to agree 


3 


on a universal channel set, and (ii) the running time of an algorithm depends on the “degree of 
heterogeneity” in the network; the running time decreases as the available channel sets become 
more homogeneous. 

Our algorithms for an asynchronous system are based on those for a synchronous system. 
Therefore, as additional contributions, we also present a suite of randomized neighbor discovery 
algorithms for an ApHeW network when the system is synchronous under a variety of assumptions 
such as: (i) whether all the nodes start executing the neighbor discovery algorithm at the same 
time or not, and (ii) whether nodes are aware of an estimate on the upper bound on the maximum 
degree of any node in the network or not. 

We believe that our approach for transforming a state-less algorithm developed for a syn¬ 
chronous M 2 HeW network to work for an asynchronous M 2 HeW network can also be applied to 
other important communication problems in an M 2 HeW network with running time increasing by 
only a constant factor. 

We show that our randomized algorithms can easily tolerate unreliable channels. We also prove 
that, our algorithms, with minor modification in the asynchronous case, are tolerant to jamming 
attacks by a reactive but “memory-less” jammer under certain assumption. The running time of 
our algorithms, when subject to a jamming attack, increases by at most a constant factor. In 
fact, for sufficiently large values of system parameters (namely, number of nodes and number of 
channels), the running time increases only by a factor of at most two in the worst-case. 

Organization: The rest of the manuscript is organized as follows. We describe our system model 
for a multi-hop multi-channel heterogeneous wireless network in Section [2j For ease of exposition, 
we first present a suite of randomized neighbor discovery algorithms for a synchronous system 
under a variety of assumptions and analyze their time complexity in Section [3j We then present two 
randomized neighbor discovery algorithms for an asynchronous system, which are derived from their 
synchronous counterparts, and analyze their complexity in Section |4j We discuss several extensions 
to our algorithms to enhance their applicability and improve their robustness in Section [6j Finally, 
in Section [71 we give a comparison of our contributions with existing research and also examine 
other related work done on neighbor discovery. 
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2 System Model 


We assume a multi-hop multi-channel heterogeneous wireless (M 2 HeW) network consisting of one 
or more radio nodes. Let N denote the total number of radio nodes. Nodes do not know N. Each 
radio node is equipped with a transceiver (transmitter-receiver pair), which is capable of operating 
over multiple frequencies or channels. However, at any given time, a transceiver can operate (either 
transmit or receive) over a single channel only. Further, a transceiver cannot transmit and receive 
at the same time. Transceivers of different nodes need not be identical; the set of channels over 
which a transceiver can operate may be different for different transceivers. 

Different nodes in a network may have different sets of channels available for communication. 
For example, in a cognitive radio network (a type of M 2 HeW network), each node can scan the 
frequency spectrum and identify the subset of unused or under-used portions of the spectrum even 
those that have been licensed to other users or organizations m- a node can potentially use such 
frequencies to communicate with its neighbors until they are reclaimed by their licensed (primary) 
users p)3]. Due to spatial variations in frequency usage/interference and hardware variations in 
radio transceivers, different nodes in the network may perceive different subsets of frequencies 
available to them for communication. We refer to the subset of frequencies or channels that a node 
can use to communicate with its neighbors as the available channel set of the node. For a node u, 
we use A(u) to denote its available channel set. We use S to denote the size of the largest available 
channel set, that is, S = max |*4.(it)|. Note that nodes do not know S. 

U 

We say that a node v is a neighbor of node u on a channel c if u can reliably receive any message 
transmitted by v on c provided no other node in the network is transmitting on c at the same time, 
and vice versa. We assume that the communication graph is symmetric because unidirectional 
neighborhood relationships are expensive and impractical to use in wireless networks |i?] (although 
our algorithms work for asymmetric communication graphs as well). For a node u and a channel 
c G A(u), we use A(it, c) to denote the number of neighbors, also known as degree , of u on c. We 
use A denote the maximum degree of any node on any channel, that is, A = max max A(it, c ). 

u ce.4(u) 

Note that, if nodes u and v are neighbors of each other on some channel, then u has to discover 
v and v has to discover u separately. It is convenient to assume two separate links—one from u to 
v and another from v to u. We use (it, v) to denote the link from u to v. We refer to the set of 
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channels on which the link (u, v ) can operate as the span of (u, v ) and denote it by span(u, v). Note 
that span(u,v ) C A(u) nM(u). We refer to the ratio of \span{u,v)\ to |M(u)| as the span-ratio of 
the link ( u , v). Note that the span-ratio of any link lies between ^ and 1. Further, the span-ratio of 
(u, v) may be different from the span-ratio of (v,u) (because |*4.('u)| may be different from |M(u)|). 
We use p to denote the minimum span-ratio among all links. Note that nodes do not know p. 
Intuitively, p can be viewed as a measure of the degree of homogeneity in the network—the larger 
the value, the more homogeneous the network is in terms of channel availability for nodes and links. 
The running time of our algorithms is inversely proportional to p. When all links can operate on 
all available channels (an assumption made frequently in the literature), p = 1 , which minimizes 
the running time of our algorithms. In general, the more heterogeneous the network is, the larger 
is the running time of our algorithms. 

If nodes v and w are neighbors of node u on channel c and both v and w transmit on c at the 
same time, then their transmissions collide at u. If u is listening on c when v and w transmitting 
on c at the same time, then u hears only noise. We do not assume that nodes can detect collisions, 
that is, distinguish between background noise and collision noise. 

The set of channels over which an M 2 HeW network can operate may have very different propa¬ 
gation characteristics. For example, a cognitive radio transceiver may be capable of operating over 
a wide range of frequencies ( e.g ., as low as 100 MHz to as high as 2.5 GHz HD)- Radio waves 
at different frequencies undergo different angles of refraction and therefore propagate in different 
ways. As a result, two radio nodes u and v that are neighbors on some channel c may not be neigh¬ 
bors on another channel d even if both u and v have d in their available channel sets. This may 
happen, for example, if c is a channel in the 900 MHz band and d is a channel in the 2.4 GHz band; 
a lower frequency has stronger capability to penetrate through different materials and therefore 
has a longer transmission range than a higher frequency. For ease of exposition, we first describe 
our algorithms assuming that all frequencies or channels have identical propagation characteristics. 
This implies that if a communication link from node u to node v can operate over some channel 
c € A(u) FI A{v) then it can also operate over all channels in A{u) H A(v). We later discuss how to 
extend our algorithms to handle diverse propagation characteristics of different frequencies. 

We use “log” to refer to the logarithm to the base 2 and “In” to refer to the natural logarithm. 
In this paper, we investigate the neighbor discovery problem both when the system is synchronous 


6 


and when the system is asynchronous. We now describe how we model each type of system. 

Synchronous System: We assume that the execution of the system is divided into synchronized 
time-slots. In each time slot, each node can be in one of the following three modes: (i) transmit 
mode on some channel in its available channel set, (ii) receive mode on some channel in its available 
channel set, or (iii) quiet mode when the transceiver is shut-off. Nodes may or may not start at the 
same time. 

Asynchronous System: We assume that every node is equipped with a clock. Clocks of different 
nodes are not required to be synchronized. For a clock C and time f, we use C(t) to denote the 
value of C at t. For a node u , we use C u to denote the clock of u. The clock of a node may have 
non-zero drift and the drift rate may change over time. The drift rate of a clock C at time t is 
given by ^ — 1. If C is an ideal clock, then, Vi, At > 0, C(t + At) — C(t ) = At; thus ^ = 1. 
At any given time, different clocks may have different drift rates. Further, the drift rate of the 
same clock may change over time both in magnitude and sign. We do, however, assume that the 
magnitude of the maximum drift rate is bounded and let the bound be denoted by 5. This implies 
that Vi, At > 0, 

(1 — 5) At < C(t + At) — (7(f) < (l + <5)Af (1) 

For an ideal clock, 5 = 0. In practice, 5 is quite small ( e.g ., 10~ 6 seconds/second for an ordinary 
quartz clock). 

3 Neighbor Discovery in Synchronous Systems 

We first describe our algorithms for neighbor discovery assuming that all nodes start executing 
the algorithm at the same time. We then relax this assumption and describe our algorithms for 
neighbor discovery when nodes may start executing the algorithm at different times. Later, in 
Section [U we use the algorithms for variable start times to derive randomized neighbor discovery 
algorithms for an asynchronous system. 
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// Algorithm for node u 

Input: An upper bound on maximum node degree, say A est . 

Output: The set of neighbors along with the subset of channels that are common with the neighbor. 


1 

2 


3 


4 

5 

6 

7 

8 
9 

10 

11 

12 

13 


while true do 

// execute a stage 

for i <T- 1 to l’log(Aest)! do 

// time-slot i of the current stage 
// select a channel 

c <— channel selected uniformly at random from A(u)\ 

// compute the transmission probability 

tune the transceiver to c; 

switch to transmit mode with probability p and receive mode with probability 1 — p; 
if (in transmit mode) then 
| transmit a message containing A(u); 
else if (heard a clear transmission) then 

let the received message be sent by node v containing set A\ 
add (v, A n A(m)) to the set of neighbors; 
end if 
end for 


14 end while 


Algorithm 1: Neighbor discovery algorithm for a synchronous system with identical start times and 
knowledge of upper bound on maximum node degree. 


3.1 Identical Start Times 

We first assume that the nodes know some upper bound on maximum node degree and present a 
neighbor discovery algorithm. The bound need not be tight and, in fact, may be quite loose (since 
the dependence is logarithmic on the value of the upper bound.) But all nodes should agree on a 
common upper bound. We then relax this restriction and describe an algorithm for the case when 
such knowledge is not available. 


3.1.1 Knowledge of Loose Upper Bound on Maximum Node Degree 

Let A es t denote an upper bound on the maximum node degree as known to all nodes. The execution 
of the algorithm is divided into stages. Each stage consists of \ log(A est )] time-slots. In each time- 
slot of a stage, a node randomly chooses a channel from its available channel set and transmits on 
that channel with a certain probability (and listens on that channel with the remaining probability). 
Specifically, in time-slot i of a stage, where 1 < i < [ log(A es t)], a node u transmits on the selected 
channel, say c, with probability min () and listens on c with the remaining probability. 

A formal description of the algorithm is shown in Algorithm [D We next analyze its running 
time. Consider a node u and let node v be its neighbor on a channel c. Observe that 1 < A (it, c) < 








A < A es t. Let k = max(l, [ log A (it, c)]). Clearly, we have: 


2 k ~ l < A (it, c) < 2 k 


( 2 ) 


We say that a time-slot t covers the link (v, u ) on channel c if during t: (i) v transmits on c, 
(ii) u listens on c, and (iii) no other neighbor of u transmits on c. The three conditions collectively 
ensure that u receives a clear message from v (on c during t). We say that a stage s covers the link 
(u, it) on channel c if some time-slot of s covers (v, u) on c; also, s covers (v, u ) if s covers (v, u ) on 
some c € span(v,u). Finally, we say that a sequence of stages covers the link (v,u) if some stage 
of the sequence covers (v,u). 

Consider a stage s and let r denote the time-slot of s that satisfies ©• We first compute 
the probability that (v,u) is covered on c during r. Let A(r,c), B(t,c), C(t,c) denote the three 
events corresponding to the three conditions dU) , (JHl) and dm]) , respectively, required for coverage. 
Note that the three events are mutually independent. Therefore we first compute the probability 
of occurrence of events B(t,c ) and C(t,c ) separately. We then compute the probability 

that s covers (v,u). The steps involved in computing the probabilities are fairly standard and have 
been moved to the appendix. We only state the results here. We have: 


Pr{j4(r, c)} > 
Pr{i?(r, c)} > 
Pr{C(r,c)}> 
Pr(s covers (v,u)) > 


1 


2 max(5', A) 
1 


2 |^(«)| 

1 

4 

P 


16max(5, A) 


(3) 

(4) 

(5) 

( 6 ) 


Now, consider a sequence of M = 16ma ^ g ’ A ' > In stages, where e is the probability that 

neighbor discovery does not complete successfully. We show that the probability that the link 
(v,u) is not covered within M stages is at most jfe. The steps for computing the probability are 
fairly standard and have been moved to the appendix. Formally, 


Pr((u, u) is not covered within M stages) < 


N 2 


(7) 
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Finally, we have: 


Pr(neighbor discovery does not finish within M stages) 

= Pr(some link is not covered within M stages) 

< (number of links in the network) X —k < e 

N z 

Therefore, we have the following theorem: 

Theorem 1. Algorithm [I] guarantees that each node discovers all its neighbors on all channels 
within O ^ max(5, A) l 0 g(A est ) log (y)) time-slots with probability at least 1 — e. 

3.1.2 No Knowledge of Maximum Node Degree 

One way to derive a neighbor discovery algorithm when knowledge about maximum node degree 
is not available is as follows. Starting with an estimate of one for the maximum node degree, 
repeatedly run an instance of the algorithm that assumes knowledge about the maximum degree for 
a certain number of time-slots with geometrically increasing values for the estimate. This approach, 
for example, is used by Vasudevan et al. in |58| to eliminate the assumption about network size 
knowledge made by some of their neighbor discovery algorithms. The approach has the advantage 
that the time complexity of the knowledge-unaware algorithm is asymptotically same as that of the 
knowledge-aware algorithm. This approach cannot be used here because it requires computing the 
exact number of time-slots for which an instance of the knowledge-aware algorithm ought to be run 
such that, in case the estimate is correct, the neighbor discovery completes with a desired success 
probability. Computing the number of time-slots for our (knowledge-aware) algorithm requires 
nodes to a priori know the values for other system parameters, namely N, S and p, whose values 
may not be known in advance. 

We instead employ the following approach used in [46]. Starting with an estimate of two, we 
repeatedly execute an instance of stage with sequentially increasing values for the estimate. 

A formal description of the algorithm is shown in Algorithm [2] It can be easily verified that, 
once d becomes at least A, each stage thereafter contains a time-slot that satisfies ([2]). To reach the 
success probability of at least 1 — e, from the analysis of Algorithm [lj it suffices for an execution to 
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// Algorithm for node u 

Output: The set of neighbors along with their available channel set 

1 d <- 2; 

2 while true do 

3 execute an instance of stage described in Algorithm |T| with A es t set to d; 

4 d t— d + 1; 

5 end while 

Algorithm 2: Neighbor discovery algorithm for a synchronous system with identical start times and 
no knowledge of maximum node degree. 

contain M = 16 max(5,A) ^ stages each of which consists of a time-slot that satisfies Q- In 

other words, neighbor discovery completes successfully with probability at least 1 —e within A + M 
stages. Note that M = fl(A). Therefore we have: 

Theorem 2. Algorithm [H guarantees that each node discovers all its neighbors on all channels 
within 0(M log M) time-slots with probability at least 1 — e, where M = 16 max(s,A) j n . 

3.2 Variable Start Times 

We first assume that the nodes know some upper bound on maximum node degree and present a 
neighbor discovery algorithm. We then relax this restriction and describe an algorithm for the case 
when such a knowledge is not available. Let T s denote the time by which all nodes have initiated 
neighbor discovery. 

3.2.1 Knowledge of Good Upper Bound on Maximum Node Degree 

We assume that nodes know a “good” upper bound on the maximum node degree. Although the 
algorithm works even if the upper bound is loose, the running time of the algorithm may be too 
large since it depends linearly on the value of the upper bound. 

The main idea behind our algorithm is to ensure that the transmission probability of a node 
is the same for every time-slot (but may be different for different nodes). This allows us to prove 
that a given link is covered in a time slot with “sufficiently high” probability. Let A es t denote 
an upper bound on the maximum node degree as known to all nodes. In each time-slot, a node 
u randomly selects a channel from its available channel set, say c. It then transmits on c with 
probability min ^ and listens on c with the remaining probability. A formal description of 

the algorithm is shown in Algorithm [3l 
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// Algorithm for node u 

Input: An upper bound on maximum node degree, say A est . 

Output: The set of neighbors along with the subset of channels that are common with the neighbor. 


// compute the transmission probability; 


same for all time-slots 


1 a;,, 

2 while true do 

// select a channel 

3 c<— channel selected uniformly at random from A(u); 

4 tune the transceiver to c; 

5 switch to transmit mode with probability p and receive mode with probability 1 — p; 

6 if (in transmit mode) then 

r | transmit a message containing A(u)\ 

8 else if (heard a clear transmission) then 

9 let the received message be sent by node v containing set A; 

10 add (v, A n A(u)) to the set of neighbors; 

l end if 

i end while 


Algorithm 3: Neighbor discovery algorithm for a synchronous system with variable start times and 
knowledge of upper bound on maximum node degree. 


For the analysis of the running time, as in the case of Algorithm |T] we can compute the 
probability of occurrence of events A(t,c), B(t,c ) and C(r,c). We have: 


Pr{A(r,c)} = 


1^)1 


x mm 


1 


1 \A(v)\ 

2’ A est 


> 


max{ 2\A(v)\,A est } 
1 


max(2S, A est ) 


( 8 ) 


It can be verified that the inequalities for Pr{B(r, c)} in ([4j) and Pr{C(r, c)} in © are still 
valid (although the proofs have to be slightly modified). Using an analysis similar to that for 
Algorithm [I] we can prove the following result: 

Theorem 3. Algorithm 0 guarantees that each node discovers all its neighbors on all channels 
within O | max t 2 ^ A|;st ) J 0 g (A)^J time-slots of T s with probability at least 1 — e. 

Note that we no longer have a factor of 0(log(A es *)) in the time complexity because we do not 
have stages any more. 
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// Algorithm for node u 

Output: The set of neighbors along with the subset of channels that are common with the neighbor. 

1 it- 1 ; 

2 while true do 

// epoch i 

3 for j <— 1 to i + 1 do 

// phase j in epoch i 

4 execute an instance of Algorithm [3] with A e3 t. = 2? for 2* time-slots; 

5 end for 

6 i <— i+ 1; 

r end while 


Algorithm 4: Neighbor discovery algorithm for a synchronous system with variable start times (no 
knowledge assumed). 

3.2.2 No Knowledge of Maximum Node Degree 

We refer to the difference in the start times of two nodes as slack, and use 0 to denote the maximum 
slack between any two nodes (expressed in terms of the number of time-slots). Also, note that no 
node knows 0. In fact, we present a neighbor discovery algorithm in which no node knows any of 
the other system parameters, namely, S, A, p, N and e. And, yet the algorithm completes neighbor 
discovery within a “short” period with high probability. 

The execution of the algorithm is divided into epochs. Each epoch is further divided into phases. 
Each phase consists of a certain number of time-slots. Specifically, epoch i, for i = 1,2,3,..., 
consists of i T 1 phases. In phase j of epoch i, where 1 < j < i + 1, we execute an instance 
of Algorithm [3] with A es t set to for a fixed number of time-slots given by 2*. Note that the 
number of phases in an epoch increases linearily but the length of a phase in an epoch increases 
geometrically with the epoch number. Also, note that all phases in an epoch are of the same length 
but use geometrically increasing values for estimate of the upper bound on maximum node degree. 
A formal description of the algorithm is shown in Algorithm |H 

In order to show that Algorithm |4] guarantees that neighbor discovery completes with high 
probability within a certain number of time-slots, it is sufficient to identify a phase in an epoch 
such that: (i) nodes use a sufficiently large value as an estimate for maximum node degree during 
this phase, and (ii) this phase for all nodes overlaps for a sufficiently long time so as to allow all 
links to be discovered. 

Let Aq denote the smallest power of two greater than or equal to 6. Further, let M = 
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i6max(s,Ao) j n ^ A_ j . Given a node u, we refer to the phase numbered log Ao of the epoch numbered 
[ log(M + 0)] as the potent phase of node u. (We call it potent because we show that neighbor 
discovery is guaranteed to complete with high probability in this phase.) In its potent phase, each 
node executes an instance of Algorithm [3] with A es t set to Ao for at least M + 0 time-slots. Let 
T& denote the latest time at which some node begins executing its potent phase. Likewise, let T e 
denote the earliest time at which some node ends executing its potent phase. Note that the time 
difference between when two nodes start the same phase is same as the slack between the two nodes 
(the time difference between when two nodes start the neighbor discovery algorithm). Since the 
maximum slack between any two nodes is 0, clearly, T e — T& > M. This implies that, during the 
time period from Tj, to T e , the following holds: (i) the potent phases of all nodes mutually overlap 
in time, and (ii) the time-period contains at least M time-slots from the potent phase of every node. 
We refer to the period of time from T& to T e as the potent period. During the potent period, each 
node uses the same estimate for the upper bound on maximum node degree, namely Ao- Using an 
analysis similar to that for the previous algorithm, it can be shown that a single time-slot during 
the potent period covers a given link with probability at least 16 a 0 ) • This in turn implies 

that the probability that neighbor discovery completes successfully during the potent period is at 
least 1 — e. 

Note that epoch i contains (i + 1) • 2* time-slots. Thus, the total number of time-slots contained 
in epochs 1 through i is given by i ■ 2* +1 (proof is given in the appendix). As a result, we have: 

Theorem 4. Algorithm [7] guarantees that each node discovers all its neighbors on all channels 
within 0((M + 0)log(M + 0)) time-slots of T s with probability at least 1 — e, where 

yp _ 16max(g,A 0 ) ^ ^ —^ 

4 Neighbor Discovery in Asynchronous System 

We first describe an algorithm that assumes that nodes know a good upper bound on maximum 
node degree. We then describe a more general algorithm in which no such knowledge is assumed. 
The second algorithm, however, requires a tighter bound on the clock drift rate than the first 
algorithm. Both algorithms are derived from their synchronous counterparts in which different 
nodes can start at different times. 
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Figure 1: Execution of a node with respect to its local clock. All frames are of length L. All 
slots are of length ^. 
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Figure 2: Execution of the network with respect to real time. Frames may be of different lengths 
(even within the same node). Slots may be of different lengths (even within the same frame). 

4.1 Knowledge of Good Upper Bound on Maximum Node Degree 

Our neighbor discovery algorithm for an asynchronous system is based on our neighbor discovery 
algorithm for a synchronous system with variable start times (Algorithm [3]). Let A es t denote an 
upper bound on the maximum node degree as known to all nodes. In addition, our algorithm makes 
the following assumption about the maximum drift rate of the clock of any node 5: 

Assumption 1. The maximum drift rate of the clock of any node is bounded by ^ seconds/second. 

The offset or skew between clocks of any two nodes may be arbitrarily large and, in fact, may 
grow with time. We now describe how to extend Algorithm [3] to solve the neighbor discovery 
algorithm in an asynchronous system. 

Each node divides its time into equal-sized frames. Frames of different nodes are not required 
to be synchronized and may, in fact, be misaligned. A node measures the duration of a frame using 
its local clock. The length of a frame as measured by a node using its local clock is same for all 
nodes, say L. Note that, because of the clock drift, duration of a frame, when projected on real 
time, may be different from L —shorter than L in case of positive drift and longer than L in case 
of negative drift. Specifically, it can be verified that the length of a frame in real time lies in the 
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// Algorithm for node u 

Input: An upper bound on maximum node degree, say A est . 

Output: The set of neighbors along with the subset of channels that are common with the neighbor. 


// compute the transmission probability 

, • ( 1 |A(u)| \ 

1 p ■<— min 3 S“ J ; 

2 while true do 

// select a channel 

3 c <— channel selected uniformly at random from .A(ii); 

4 tune the transceiver to c; 

5 switch to transmit mode with probability p and receive mode with probability 1 — p; 

6 if (in transmit mode) then 

r | transmit a message containing _4(u) during each slot of the frame; 

8 else 


9 

10 

n 

12 

13 

14 


// in receive mode 

foreach (clear message received during the frame) do 

let the received message be sent by node v containing set A\ 
add (v, A n A(u)) to the set of neighbors; 

end foreach 
end if 
end while 


Algorithm 5: Neighbor discovery algorithm for an asynchronous system with a bound on clock drift 
rate and knowledge of upper bound on maximum node degree. 


range: 

- < frame length in real-time < -- (9) 

1 + <5 “ “1-5 v ' 

A node further divides each frame into three equal-sized slots—equal with respect to its local clock. 
Therefore the duration of a slot as measured by a node using its local clock is j. At the beginning 
of each frame, a node u randomly selects a channel from its available channel set, say c. It then 
transmits on c during each slot of the frame with probability min and listens on c during 

the entire frame with the remaining probability. In the former case, u transmits the same message 
during each slot of the frame. In the latter case, u listens for any clear messages it may receive 
during any part of the frame. The partition of a frame into slots is not important for a receiving 
node; they are only used by a transmitting node. Also, a node may receive multiple clear messages 
while listening during a single frame. The execution of a node with respect to its local clock is 
shown in Fig. [I] The execution of the network with respect to common real time is shown in Fig. [2j 
A formal description of the algorithm is shown in Algorithm [BJ We now analyze the running 
time of the algorithm. In our analysis, unless otherwise stated, time refers to real-time. Of course, 
nodes do not have access to the real-time. We define two notions we use in our analysis as follows: 

Definition 1 (aligned pair). We say that a pair of frames (f,g) is aligned if at least one slot of 
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f lies completely within g. 


Definition 2 (overlapping frames). For a frame f and a node u, let overlap(f,u) denote the 
subset of frames of u that overlap in real-time with f. Further, let overlap-all(f) denote the subset 
of all frames that overlap in real-time with f. 

For example, as per the execution shown in Fig. [2J pairs (fi,gi) and (f 2 , hi) are aligned whereas 
the pair is not. Also, overlap{g 2 ,v ) = {/i,/ 2 } and overlap-all(g 2 ) = {/i, / 2 , 92 , h 1: h 2 }. 

One of the arguments we commonly use in our analysis is: u x adjacent slots of a node cannot 
strictly contain y adjacent slots of another node” for certain specific values of x and y with x < y. 
To prove this, we argue that, for the statement to be false, the following inequality must hold: 

xL yL 

3(1 - 5) > 3(1 + <5) 

and then show that it contradicts Assumption [TJ The above inequality must hold for the statement 
to be false because the left hand side denotes the largest possible length of the time interval 
containing x adjacent slots, and the right hand side denotes the smallest possible length of the time 
interval containing y adjacent slots. 

Lemma 5. A frame of a node overlaps with at most three frames of any other node. Formally, 

Vf,u :: \overlap(f,u)\ < 3 

Proof. Assume, in order to obtain a contradiction, that a frame of some node, say /, overlaps with 
at least four frames of another node. This implies that there are at least two consecutive frames 
that are strictly contained within /. From ([9|), a frame of a node can strictly contain two frames 
of another node only if the following condition holds: 

L 2L . 1 

--f > --= =>■ <5 > - 


This contradicts Assumption [1] □ 

Consider the link from node v to node u on channel c. Also, consider frames / of v and g of u 
such that the pair (/, g) is aligned. We extend the notion of a link covered by a time-slot (defined 
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in Section 13.1.11) to a link covered by an aligned pair of frames. Specifically, the pair of aligned 
frames (/, g) covers the link (v,u) on channel c if: (i) v transmits on c during /, (ii) u listens on c 
during g , and (iii) no other neighbor of u , say w, transmits on c during any frame in overlap(g, w). 
Also, (/, g) covers [y,u) if (/, g) covers (y,u) on some channel c £ span{v,u). The three conditions 
collectively ensure that u receives a clear message from v (on c during g) provided (f,g) is aligned. 
We have: 


Lemma 6. If (f,g) is aligned, then (f,g) covers (v,u) with probability at least 


_ p _ 

8max(2S,3A es t) ' 


Proof. Analogous to A(r, c), B(t,c) and C(t, c) defined in Section [3T7TJ let A(f,g,c), B(f,g,c), 
C(f, g , c) denote the three events corresponding to the three conditions (0), ([n]) and (lull) , respectively, 
required for coverage. Note that the three events are mutually independent. As before, we compute 
the probability of occurrence of the three events separately. 


Computing the probability of occurrence of A(f, g, c ) We have: 


Pr {A(f,g,c)} = ( v selects c at the beginning of /)A 


(v chooses to transmit during /) 
1 . f 1 |A(u)| 

, , . X mm -, —-- 

|A(u)| \2 3A est 

1 1 


= mm 


> 


2\A(v)\ ’ 3A es t 
1 


max(2S, 3A est ) 


Computing the probability of occurrence of B(f,g,c ) We have: 


( 10 ) 


Pr {B(f,g,c)} = ( u selects c at the beginning of g) A 


{u chooses to listen during g) 
1 \A(u)\ 


1 


X < 1 — min 


> 


\A(u)\ 
min(x, y) < x 
1 


2’ 3A, 


est 


\A(u)\ 


x 1-- = 


2\A(u)\ 


( 11 ) 
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Computing the probability of occurrence of C(f,g,c ) Let J\f(u,c ) denote the set of neigh¬ 
bors of u on c. Note that, if N(u,c) only contains v, then Pr {C(f,g,c)} = 1. Otherwise, we 
have: 


Pr {C(f,g,c)} 


n pr 


w€J\f(u,c) 

w^v 


w does not transmit on c during any frame 
in overlap(g , w) 


n n pr 

w£j\f (u,c) h£overlap(g ,w) 

Wy^V 


w does not transmit on c 
during h 


> 


n n {1 — Pr(w transmits on c during h)} 

i 00)1' 


w£j\f (u,c) h£overlap(g ,w) 
w^v 


> 


n n 

w£j\f (u,c) h£overlap(g,w) 

Wy^V 

n n 

w£Af(u,c) h£overlap(g,w) 
w^v 


min(x, y) < y 


n n 

w£j\f (u,c) h£overlap(g,w) 
w^v 


1 - 


1 - 40)1 


x nun 


1 — min 


2’ 3A est 

1 


2|^H|’3A eS i 


1 - 


3A, 


est 


using Lemma0 \overlap(g,w)\ < 3 


> 


n f 1 3A est 


w£Af(u,c) 
w^v 


( 3A est ) 


\ 3(|A/"(m,c)|—1) 


3A es t 

c) | — 1 = A (u, c) - 1 < A est 


> 1 - 


1 


3A, 


est 


3A e 


{Vx > 2, a-r is a monotonically increasing 
function of x and thus > |} 
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1 

> - 

- 4 


( 12 ) 


Finally, using a derivation similar to that for ([6]), it can be shown that (f,g) covers (v,u) with 
probability at least 8max(2 ^ 3Aest) • □ 

For a frame /, we use node(f) to denote the node to which / belongs. For example, as per the 
execution show in Fig. [2l node(fi) = v and node{h\) = w. We now define a precedence relation 
between pairs of frames referred to as frame-pairs for short. 

Definition 3 (precedence relation). For frame-pairs { f,g ) and ( p,q ), we say that { f,g ) pre¬ 
cedes (p,q), denoted {f,g ) C ( p,q), if (i) node(f) = nodefp ), (ii) node(g) = node(q), (in) the start 
time of f is before that of p, and (iv) the start time of g is before that of q. 

For example, as per the execution shown in Fig. [2l (fi,gi) IZ (f 2 ,g 2 ) and (fi,gi) C (./a, 32 ) but 
(/i)5i) (/1 ■ < 72 )- We next define a notion on a sequence of frame-pairs that intuitively enables us 

to treat coverage provided by different frame-pairs as essentially independent events. 

Definition 4 (admissible sequence). A sequence o of M frame-pairs < (fi k ,gj k ) f is ad- 

l J 1 <k<M 

missible with respect to the link ( v , u ) if it satisfies the following conditions: 

1. \/i : 1 < k < M : node(fi k ) = v and node(gj k ) = u, 

2. Vz : 1 < k < M : ( fi k ,g jk ) C (fi k+1 ,gj k+1 ), 

3. Vz : 1 < k < M : (fi k ,gj k ) is aligned, and 

4■ Vz : 1 < k < M : overlap-all(gi k ) H overlap-all(gi k+1 ) = 0. 

For a sequence a of frame-pairs that is admissible with respect to (v,u), we say that a covers 
(v,u) if some frame-pair in the sequence covers ( v,u ). 

Lemma 7. Let a be a sequence of 8max ( 2 ^ 3Ae ^ f ) j n ^ frame-pairs such that a is admissible with 
respect to ( v,u ). Then the probability that a does not cover (v,u) is at most 

Proof. For convenience, let M = 8max ( 2 ^ 3A e S t) i n ^ . Let with 1 < k < M denote the 

event that ( fi k ,gj k ) covers (v,u) on some channel. Note that the occurrence of the event 
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Figure 3: Slots used in the proof of Lemma [HI 


depends only on frames in overlap-cill(gj k ). Since a is an admissible sequence, for all x and y, with 
1 < x < y < M, overlap-all(gj x ) D overlap-all(gj y ) = 0. In other words, the set of frames that 
overlap with the frame gj x are distinct from the set of frames that intersect the frame g ]y . Since, 
for each frame, a node chooses its action randomly, events E^s are mutually independent of each 
other. Using a derivation similar to that of ([7]), it can be shown that a does not cover (y,u) with 
probability at most -Ay. □ 

We next show that any execution of the network must contain a “sufficiently long” sequence 
of admissible frame-pairs. In the remainder of this section, let T s denote the time by which all 
nodes have initiated the neighbor discovery algorithm, Algorithm [5[ To show the existence of a 
“‘sufficiently long” admissible sequence, we first show that, for any instant of time T after T s and, 
for any pair of neighboring nodes, there is an aligned pair of frames after but “close” to T. 

Lemma 8. Consider a link from node v to node u and some instant of time T with T >T S . Let 

fi (respectively, gi) with i > 1 denote the i th full frame of node v (respectively, node u) after T. 

Then some frame in {/i,/ 2 } is aligned with some frame in {< 71 , 52 }- 

Proof. Note that the frames fi and together contain six slots. Let a* for i = 1, 2,..., 6 denote 

the start times of the six slots numbered in the increasing order of their start times (see Fig. [3]). 
Likewise, let bi for i = 1, 2,..., 6 denote the start times of the six slots of the frames g± and 52 - 


Claim 1: We first show that b\ < 05 . Assume, by the way of contradiction, that 05 < b\. Note 
that, by definition of gi, there is only a partial frame of u between T and 61 . This, in turn, implies 
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Figure 4: Various cases in the proof of Lemma 0 


that a partial frame of u contains at least four slots of v. This can happen only if the following 
condition holds: 


L 4L 

> 


1-6 3(1 + 6) 

This contradicts Assumption [TJ 

Claim 2: Likewise, we can show that <21 < 65 . 


6 > - 
7 


Claim 3: If a*- 1 < b± < a* for some i with 1 < i < 5, then b\ < ai < a,;+i < 64 . It suffices 
to show that a,; + i < 64 . Assume, by the way of contradiction, that 64 < a l+ \. This implies that 
Oj _1 < 61 < 64 < a t +1 . In other words, two adjacent slots of v strictly contain an entire frame of u. 
This can happen only if the following condition holds: 

2L L __ 1 

3(1-5) > 1 + 5 ^ > 5 

This contradicts Assumption [lj 
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We now prove the lemma statement. We consider two cases depending on whether f\ or g\ has 
earlier start time. 

• Case 1 (ai < b\): In this case, we have ai < 64 < 05 (see Fig. 01 (a)). The rightmost inequality 
follows from Claim 1 . The time interval from ai to 05 consists of four contiguous slots of v: 
[01,02], [02,03], [03,04] and [04,05]. Clearly, 64 lies in at least one of them. (If 64 lies on the 
boundary of two slots, we select the earlier one.) In any case, using Claim 3 , we can show 
that at least one of the slots [02,03], [03,04], [04,05] or [05,05] is contained in the frame <72- 
This implies that either (f\. g\) or (/2,<?i) is aligned. 

• Case 2 (b\ < a\): If 02 < 64, then 64 < 01 < 02 < 64 (see Fig. 0 (b)). In other words, the 
first slot of the frame f\ is contained with the frame g \, which implies that (f \, g\) is aligned. 
Therefore assume that 64 < 02. We have two subcases depending on where 04 lies relative to 
64. Let 67 denote the end time of the frame <72 • 

— Case 2.1 (a\ < b 4 ): In this case, we show that the slot [02,03] is contained within the 
frame [64,67] (see Fig. 0 (c)). To that end, we first prove that 03 < 67. If not, then 
01 < 64 < 67 < 03. In other words, two adjacent slots of v strictly contain an entire 
frame of u. This can happen only if the following condition holds: 

2 L L 1 

3(1 -5) > 1 + 6 ^ > 5 

This contradicts Assumption 0 Therefore, we have 64 < 02 < 03 < 67, which implies 
that the pair (/i,^) is aligned. 

— Case 2.2 (b 4 < a\): In this case, we show that the slot [04,02] is contained within 
the frame [64,67] (see Fig. 0 (d)). To that end, we show that 02 < 67. If not, then 
a i < 65 < 67 < 02- (The inequality 04 < 65 follows from Claim 2 .) In other words, one 
slot of v strictly contains at least two adjacent slots of u. This can happen only if the 
following condition holds: 


L 2L 

3(1 - 6 ) > 3(1 + 5) 


6 > 


1 

3 
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This contradicts Assumption |T| Therefore, we have 64 < ai < 02 < 67 , which implies 
that the pair (fi,g 2 ) is aligned. 

In all cases, we show that one of the four pairs {fi,gi), (/i,<? 2 ), {f 2 ,gi) or (/ 2 ,<? 2 ) is aligned.□ 

We now show that existence of a “sufficiently long” admissible sequence in any execution of 
Algorithm [5j 

Lemma 9. Consider a link from node v to node u. Further, consider an execution of the network 
after T s that contains at least M full frames of u as well as v. Then the execution contains a 
sequence of at least ^ frame-pairs such that the sequence is admissible with respect to ( v,u ). 

Proof. The proof is by construction. The construction is in two steps. In the first step, we construct 
a sequence of frame-pairs 7 that is “almost” admissible in the sense that it satisfies the first three 
properties of an admissible sequence but may not satisfy the fourth property. We show that 7 
contains at least 44 frame-pairs. In the second step, using 7 , we construct a sequence of frame- 
pairs a that satisfies all four properties of an admissible sequence. We also show that u contains 
at least ^ frame-pairs. 

Constructing 7 To obtain the first frame-pair (/q,^) that is aligned, we apply Lemma[ 8 ]to T s . 
Now, assume that we have already selected k frame-pairs satisfying the first three properties of an 
admissible sequence. Let the k th frame-pair be denoted by ( fi k ,gi k )• To select the next frame-pair, 
let Tfc be defined as the earlier of the end times of frames fi k and gi k . To obtain the next frame-pair 
that is aligned, we apply Lemma[ 8 ]to Tf.. Let the frame-pair be denoted by {fi k+1 , gi k+1 )■ Clearly, 
the extended sequence (consisting of k + 1 aligned frame-pairs) satisfies the first and third properties 
of an admissible sequence. The second property holds because the start times of both fi k and gi k 
are before Tk, whereas the start times of both fi k+1 and gi k+1 are after T \We repeatedly select 
aligned frame-pairs using Lemma [ 8 ] until we run out of frames of either u or v. We now establish 
a lower bound on the length of 7 . Note that, when selecting the (k + l) st pair, the first full frame 
of node v after T(namely frame f\ in the statement of Lemma [ 8 ]) is adjacent to the frame fi k . 
This follows from the definition of X*.. This, in turn, implies that the frame fi k+1 obtained using 
Lemma[ 8 ]is within a distance of two of the frame fi k . Likewise, the frame gt k+1 is within a distance 
of two of the frame gt k . As a result, 7 contains at least 4f frame-pairs. 


24 


Constructing a To construct a sequence a that also satisfies the fourth property of an admissible 
sequence, we choose every third frame-pair of 7 starting with the first frame-pair (/q, g t] ) • Clearly 
a also satisfies the first three properties of an admissible sequence (since it is a subsequence of 7 ). 
Let the k th frame-pair of a be denoted by ( fj k i9j k )■ To prove that a satisfies the fourth property 
as well, assume, by the way of contradiction, that some frame, say h, overlaps with two consecutive 
frames gj k and gj k+1 for some k. Note that, since we selected only every third frame-pair of 7 
to construct a, there are at least two other frames of u between gj k and gj k+1 ■ This implies that 
h overlaps with at least four frames of u, which contradicts Lemma [5j This establishes that the 
length of a is at least □ 

Finally, we have the main result. 

Theorem 10. Let T s be the time by which all nodes have initiated neighbor discovery. Also, let 
Tf be the earliest time by which each node has executed at least 48max (^ 5 '’ 3Aes *) i n ^ full frames 
since T s . Then Algorithm 0 ensures that each node discovers all its neighbors on all channels with 
probability at least 1 — e by time Tf. 

Proof. Consider a link (v,u). From Lemma 0 the execution from T s to Tf contains a sequence of at 
least 8max ( 2 ^’ 3Aest ) | n frame-pairs such that the sequence is admissible with respect to (v,u). 

From Lemma [TJ the probability that (y,u) is not covered by Tf is at most fffz. This implies that 
the probability that some link in the network is not covered by Tf is at most e. □ 

We now bound the length of the interval Tf — T s . 

Theorem 11. Let T s and Tf be as defined in Theorem 1 1 (A Then the length of the interval Tf — T s 
is upper bounded by j 48max(2ff,3A es t) + 1 j ( 775 ) • 

Proof. By our choice of Tf, there exists some node such that Tf — T s contains exactly 
48max(2S , ,3A esf ) ^ j full frames of that node. The first frame of that node in the execution 
may be partial frame. From Q, the length of each frame is upper bounded by -^7. Combining 
the three facts, we obtain the result. □ 

In our algorithm, we divided our frame into three slots. It is possible to divide a frame into only 
two slots and a result similar to Theorem [TT| can be derived. However, the proof is more involved. 
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// Algorithm for node u 

Output: The set of neighbors along with the subset of channels that are common with the neighbor. 


1 

2 

3 


4 


5 


6 

7 

8 
9 

10 

11 


i <-1; 

while true do 
// epoch i 
for j 4— 1 to i + 1 do 

// compute the transmission probability 

for k -4— 1 to 2* do 

// phase j in epoch i 
// select a channel 

c <— channel selected uniformly at random from A(u); 
tune the transceiver to c; 

switch to transmit mode with probability p and receive mode with probability 1 — p\ 
if (in transmit mode) then 

| transmit a message containing A(u) during each slot of the frame; 
else 


// in receive mode 

12 foreach (clear message received during the frame) do 

13 let the received message be sent by node v containing set A\ 

14 add (v, A n A(ti)) to the set of neighbors; 

is end foreach 

16 end if 

17 end for 

is end for 

19 i <— i + 1; 

20 end while 


Algorithm 6: Neighbor discovery algorithm for an asynchronous system with a bound on clock drift 
rate. 


Likewise, a frame can be divided into four or more slots albeit at the expense of increasing the 
running time of the algorithm. 


4.2 No Knowledge of Maximum Node Degree 

The algorithm for this case is similar to the one for a synchronous system, namely Algorithm [U but 
with time-slots replaced with frames and transmission probability calculated as in Algorithm [5l A 
formal description of the algorithm is given in Algorithm 0 

The time-complexity analysis of Algorithm [6] is more involved because of asynchrony. Unlike in 
a synchronous system, the time difference between when two nodes start the same phase may grow 
with time due to clock drift. We need some additional notation for the analysis. 

Let A (i,j) denote the phase j of the epoch i. Let F t ^ denote the number of frames that a node 
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in the execution until (but not including) AWe have: 


Fi j = (number of frames contained in the first i — 1 epochs) + 

(number of frames contained in the epoch until A (i,j)) 

{using (|A.6|) in the appendix and the fact that, for each j, |A(«, j)| = 2*} 

= (» — !)■ 2* + (j - 1) ■ 2* 

= (* + j-2)-2 < (13) 

Let T b “(i, j) denote the time at which node u begins executing A (i,j). Likewise, let T™(i,j) 
denote the time at which node u ends executing A(i, j). Let T™ ax (L j) denote the latest time 
at which some node starts executing ALikewise, let T“ lin (i,j) denote the earliest time at 
which some node starts executing A (i,j). Symbols T s , 0 and L have the same meaning as before. 
Note that, in real-time, the minimum and maximum lengths of a frame are given by j-fg and 
respectively. Thus, we have: 


T b max (i, j) 

= max{T b “(i, j)} 

VI 

+ F 4i-s, 

) 

(14) 

r e min (i,j) 

= min{T“('i,j)} 

Al 

- 0) + (Fij + 2 l ) ^ 

A) 

(15) 


Suppose i and j are such that T™ in (i,j) < T b max (i, j). Note that, during the time-period from 
T™ ax (i, j) to T f miI1 (z, j), all nodes will use the same estimate for maximum node degree. Recall that 
Ao denotes the smallest power of two greater than or equal to 5. Let M = 18 md.x(2S,3A 0 ) j n +1. 
From Theorem 1101 if the time period from TF iax (i. j) to T 0 rnm (z, j) is such that: (a) the time period 
contains at least M full frames and (b) each node uses an estimate value of Ao during the time 
period, then the neighbor discovery is guaranteed to complete with probability at least 1 — e. Let 
io = [log (6 M + ^y)~| and jo = |" logA](= log Ao). Note that, by definition, during A(*o, jo), each 
node uses Ao as the estimate for maximum node degree. It suffices to prove that the time period 
from T s iojo to Tfi^jo contains at least M frames. To that end, we make the following assumption 
about the clock drift rate: 

Assumption 2. Let D = 48 {log (^) + log (l + ®) + 5} . The maximum drift rate of the clock 
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of any node is bounded by seconds/second. 

We first prove the following lemma: 

Lemma 12. If Assumption [H holds, then iq ■ 5 < . 

Proof. We have: 


M = 


< 


48 max (25,3Ao) 

-In 


( 48- (25) • (6N)\ 

l i ) 

576 • 5 2 • N 3 1 

-+ 1 


e 



This implies that: 


log M < log 576 + 3 log 


NS 


+ 1 


< 3 log 


”) 


+ 11 


Now, we have: 


iq ■ S < {log ( 6 M + J + 1 } ■ 6 

< <J log 6 + log ( M + ® + 1 ) + 1 } ■ 6 

{for all x , y with x > 1 and y > 1, log (a: + y) < log(xy) < log x + log y} 
0 


< |log(M) + log ^1 + — ) + 4 { • 6 
{using (fl6D | 

< 13 log + 11 + log ^1 + ® ) + 4 J> • 5 

< 3 • | log + log ^1 + -+ ) + 5 )■ • 5 

{using definition of D} 

<3 . — 

- 48 


{using Assumption [2]} 


(16) 
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< 3 - 


D 1 


1 


48 D 16 


This establishes the lemma. 


□ 


Now, substituting i = io and j = jo in (fT4l) and (fT5l) and subtracting (fTTl) from (fl5l) . we obtain: 


~ TT“(io, jo) 

>(r. 0J0 + 2»)( T h I )-r, 0J0 ( T h_|- e 


{rearranging terms} 

=(tTj) - 1 - 9 

{using {13])} 

= 2*° ( - (i 0 + j 0 - 2) • 2 i0 (1 - 0 


> 2 *° 


l + <5 
L 


1 — (*o + jo ~ 2) 


1 - 5 2 
25 


1+5 )| v ' V 1 - d 

{*o + jo — 2 < 2io} 


- 0 


5 < 


1 


>2*o (^L\ (1 — 8 • z 0 • d) — 0 

{using Lemma IT2l} 

> 2-.(|)-0 

2 |6M + f) (I)- 6 


= 2 ML + 0 - 0 = 2 ML 


We now prove the main result. Since the maximum length of a frame in real-time is and 

5 < it follows that the time-period from T™ ax (io, jo) to T b max (io, jo) contains at least = 
2M(1 — 5) > M full frames. 

The above analysis implies that, once all nodes have completed their epoch io, the neighbor 
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Table 1: A summary of all randomized neighbor discovery algorithms in this work. 


Assumptions 

Time Complexity* 

Section 

Synchrony 

Start Times 

System Parameters 

Synchronous 

Same 

know an estimate for an upper 
bound on A, given by A es t 

0 ( max(S,A) log(Aest)log (^)) 

I3.1.H 

Synchronous 

Same 

- 

O ( M log M) where 

M — I 6 max ( s > A ) j n ^ n2 ^ 

13X21 

Synchronous 

Different 

know an estimate for an upper 
bound on A, given by A es t 

q ^ max(2S',A es t) ^ N ^ ^ 

13.2.11 

Synchronous 

Different 

- 

0((M + ©) log(M + 0)) where 
— 16max(S,A 0 ) J n ( JV 2 'j 

[3X2] 

Asynchronous 

Different 

know an estimate for an upper 
bound on A, given by A es t, and 

<5<7 

q ^ L max(25’,3A es t) 

rrn 

Asynchronous 

Different 

5 ~ 48{log(^)+log(l+f )+5} 

O ({ML + 0) log (m + § j) where 

M _ 48max(2S,3A 0 ) l n ^N 2 ^j _|_ j 

roi 


*: In the case of variable start times, the time is measured after the last node has started neighbor discovery 
N : total number of CR nodes in the network 

A: maximum degree of a node on a channel Ao: smallest power of two greater than or equal to A 

S : size of the largest available channel set p : minimum span ratio of a link 

e: maximum error probability (user-specified) 

©: maximum difference in the start times of two nodes 

L : length of a frame 5: maximum drift rate of a local clock 


discovery completes with probability at least 1 — e. Thus we have: 

Theorem 13. Let M = 48mig ^ S ’ 3A o) l n + 1. Then Algorithm [6] ensures that each node 

discovers all its neighbors on all channels within O [{M + ®) log (M + ®) time-units of 

T s with probability at least 1 — e. 

Remark (Impact of Assumption [2D • Even for very large values of system parameters, namely 
N = 10 6 ; S = 10 6 , e = 10” 9 and ® = 10 6 , Assumption d only requires 5 < 2.2 x 10~3 sec¬ 
onds/seconds. This is not very restrictive because, as we mentioned earlier, an ordinary quartz 
clock has a maximum drift rate of 10 -6 seconds/second. 

5 Summary 

In this paper, we have presented several randomized algorithms for neighbor discovery in an M 2 HeW 
network both for a synchronous and an asynchronous system, and also analyzed their running time. 
Table [T] summarizes the results of the previous two sections. We now discuss two extensions to our 
algorithms to enhance their applicability. 
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6 Improving Robustness 


6.1 Handling Diverse Propagation Characteristics 

All algorithms described so far have assumed “one-for-all” property that if a link from node v to node 
u exists on some channel then it also exists on all channels that are present in available channel 
sets of both u and v. That may not hold if different channels have very different propagation 
characteristics. In case the “one-for-all” property does not hold, we can partition the channels 
into bands such that the subset of channels within the same band have very similar propagation 
characteristics. As a result, “one-for-all” holds for each band individually. We say that a link can 
operate in a band if it can operate on some channel in the band. Each link can then be replaced with 
multiple links, one for each band it can operate in. Link for each band then has to be discovered 
separately. This modification increases the running time of our algorithms since more links have to 
be discovered now. The minimum span-ratio in the network now ranges from ^ to instead of ^ 
to 1, where W is the size of the smallest band (in terms of number of channels). Further the term 
“In (“t) ” i n time complexity expressions is replaced with the term “In where B denote 

the maximum number of bands in which any link can operate. 


6.2 Handling Lossy Channels 


All algorithms described so far have assumed that channels are reliable in the sense that a node 
fails to receive a message from its neighbor only due to collision. However, in the real world, a 
node may fail to receive a message due to other reasons as well such as Gaussian noise or transient 
interference. Let with 0 < (f> < 1 denote the probability that a node fails to receive a message 
from its neighbor even in the absence of collision. Nodes may not know 1 j>. With lossy channels, 
the probability that a link is covered by a time-slot (in Algorithm |TJ Algorithm [2], Algorithm [3] and 
Algorithm [4]) or an aligned frame-pair (in Algorithm [5] and Algorithm [ 6 |) is now multiplied by a 
factor of 1 — (f>. For example, the probability expression in Lemma [ 6 ] now becomes §^[ 95 ^ t y 
It can be verified that this has the effect of increasing the running time of an algorithm by a factor 

of ih- 
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6.3 Handling Jamming Attacks 

Most of the work on jamming attack [18H23lf3Hll59] is concerned with analyzing or mitigating the 
impact of jamming attack on communication throughput in the system, and, as such, they assume 
that neighbor discovery has already been done. Work in [6] presents a suite of neighbor discovery 
algorithms in the presence of jamming attack. However, unlike our work, the work in [6] assumes a 
single-hop network and does not provide any theoretical guarantees (algorithms are only evaluated 
using simulation experiments). 

We show that, under certain conditions, our neighbor discovery algorithms, with minor mod¬ 
ifications, are tolerant to jamming attacks in the sense that a malicious jammer who seeks to 
prevent neighbor discovery can only delay time to the completion of neighbor discovery by at most 
a constant factor. 

In formulating the jamming model, we seek a balance between a jammer who is too powerful 
(making the neighbor discovery problem impossible to solve) and one who is too weak (making the 
attack easy to foil). For instance, if the jammer can simultaneously jam all channels at all times, 
then neighbor discovery would be unable to complete. Likewise, if for some pair of neighboring 
nodes, \span{u,v)\ = 1, then the jammer can ensure that discovery does not complete by simply 
jamming this channel. 

We assume an “intelligent” jammer who can scan the spectrum and learn which channels are 
currently under use by transmitting nodes running the neighbor discovery algorithm. The jammer 
can then decide to jam any single channel, thereby disrupting possibly all transmissions on that 
channel. When scanning, the jammer only learns which channels are being used; we assume that it 
cannot determine which channels are used by a majority of nodes by analyzing the relative strength 
of the transmissions. 

In a heterogeneous system, it may be possible for a jammer to increase its effectiveness by 
keeping statistics on which channels are seen in scans more often. In some cases, this would 
correspond to channels used by more nodes. It is not clear how best to analyze this case, so we 
explicitly avoid it by directing our analysis towards homogeneous systems in which \span(u, w)| = S 
for all neighboring nodes u and v. 

The jammer can change the channel which it jams during the course of the algorithm. Clearly, 
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if the jammer is able to switch channels instantaneously, it would be possible for the jammer to 
block every channel in every slot making it as powerful as the one who can simultaneously jam 
all channels at all times. To avoid this, we assume a lower bound on the time between switching 
channels for jamming. In the asynchronous case, this lower bound also allows to assume that the 
outcomes of two consecutive scans are independent of each other. 

Note that the jammer need not respect slot boundaries even when the system is synchronous. 
Therefore the jammer can effectively jam two channels in a given slot by selecting a new channel 
to jam near the middle of a slot. 

We also assume that the jammer has complete knowledge of our algorithm, but is ignorant of 
the exact random choices which are made. Clearly, the jammer’s best option is to make a random 
selection of channels to jam. However, since the system is homogeneous, the best that the jammer 
can do is make a selection at random, in an attempt to foil the random choices made by our 
algorithm. 

Based on the discussion above, we now formulate our jamming model. The jammer operates in 
rounds. At the beginning of each round, the jammer scans the spectrum and selects a channel to 
jam during that round from among the channels it observed to be in use in the scan. Specifically, 
let j denote the channel jammed by the jammer in the previous round. Also, let C denote the set 
of channels the jammer found to be in use in the scan it performed at the beginning of the current 
round. The jammer then makes a uniform random selection of a channel from the set C \ j, if 
non-empty, and jams this channel during the current round. We assume that round length is at 
least at large as slot length, but slot boundaries and round boundaries need not be aligned even if 
the system is synchronous. (A jamming round can start in the middle of a slot.) Note that it is not 
necessary for the jammer to jam j again since it would have already disrupted all transmissions on 
channel j in the current slot. 

Note that, in the above-described jamming model, the neighbor discovery problem can only be 
solved if there are at least three nodes and at least three channels. We now consider synchronous 
and asynchronous cases separately. 
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6.3.1 Synchronous Algorithms 

Recall the events B(t,c) and C(t,c ) defined earlier when the analyzing the probability 

that node u discovers node v on channel c during time-slot r. We define a new event D(t,c) to 
represent the condition that the transmission from node v to node u on channel c is not jammed 
during the time-slot r. For convenience, let ABC(t,c) denote the conjunction of the three events 
A(t,c ), B(t,c ) and C(t,c). We derive a lower bound on the conditional probability that event 
D(t,c ) occurs given that event ABC(t,c ) occurs. Note that the slow-down factor in the running 
time of a neighbor discovery algorithm that is undergoing a jamming attack is given by the inverse 
of this conditional probability. 

Assume that, at the beginning of the time-slot r, the jammer was jamming channel j. Sometime 
during r, the jammer switches to a new channel to jam. We have: 

Pr {D(t, c ) | ABC(t, c)} = ■ ME(t, c ) | ABC(t, c)} • Q) (17) 

where E(r. c) denotes the event that some node in the system transmits on a channel other than 
c and j during the time-slot r with c j. In the above expression, the first term (given by ^jp~) 
captures the probability that c A j and the third term (given by the probability that the jammer 
chooses a channel other than c to jam. 

We next compute the conditional probability that the event E(r, c ) occurs given that the event 
ABC(t , c) occurs. We partition the set of nodes in the system into three mutually-exclusive groups: 
G \, G 2 and G 3 . Group G*i contains nodes u and v, group G 2 contains nodes that are neighbors of 
node u except for node v, and group G 3 contains the remaining nodes. 

Let A denote the set of all available channels. (Note that, by our assumption, all nodes have 
identical available channel sets.) Also, let F(t, c, w) denote the event that node w does not transmit 
on a channel in A\{c, j} during time-slot r. Since nodes act independently of each other, we have: 

Pr {E{t,c) I ABC{t,c)} = 1 - n n Pr{F(r, c, w) \ ABC(t,c)} 

ie{l,2,3} weGi 

: w € G\ : Pr {F(t,c,w) \ ABC(t,c)} = 1 
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i- n n Pt{F{t,c,w) | ABC(t,c)} 

iE{2,3} w£Gi 

Vrc : w G G 3 : Pr {F(t,c,w) \ ABC(t,c)} = Pic{F(t,c,w)} 

V nodes act independently 

1 — ( n Pr { F (A c, w) | ABC(t, c)} I • j Pt{F(t,c,w)} ) (18) 

\w£G2 ) \w£G3 


Let p denote the placeholder for transmission probability. Note that p = min For 

our analysis, we focus on time-slots for which A es t satisfies the following condition: 


A est < 2 (N - 2) 


(19) 


We show that the above-condition trivially holds for three out of four synchronous algorithms, 
namely Algorithm [TJ Algorithm [2] and Algorithm 0J For Algorithm [3J in which A est is an “external” 
parameter, we state it as an explicit assumption. In the three synchronous algorithms (in which 
nodes estimate degree), for the time-slots used to show that neighbor discovery completes with 
arbitrarily high probability, note that A es t is the smallest power of two that is greater than or 
equal to either A (u,c) or A. In either case, we have: 


A, 


est 


2 

A es t 


< A 
< A - 1 


A < N- 1 

A est. 


< (N — 1) — 1 = AT — 2 


We now derive the conditional probability Pr {F(t, c,w) \ ABC(t, c)}. There are two cases 
depending on whether w G G 2 or w G G3. First, assume that w € G2. We have: 


Pr {F(t,c,w) I ABC(t,c)} = Pr{u; does not transmit on A \ { c,j } | w does not transmit on c} 

Pr{X \Y} = Pr{ V A r } 


Pr{y} 

Pr{(u; does not transmit on A \ { c,j }) A (w does not transmit on c)} 
Pr{tc does not transmit on c} 
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Pi'{(u> listens) or (w transmits on channel j)} 

Pr{(rc listens) or (w transmits on a channel other than c)} 
l-p + p-l 


l-p + p- ^ 

l - p 

1 s 


{ 5 

< multiplying numerator and denominator by — 

l P 

f-S + 1 


= 1 - 


- 1 
5-2 

- 1 


( 20 ) 


Next, assume that w E G 3 . We have: 


Pr{F(r, c, w) | ABC(t, c)} = Pr{tc does not transmit on A \ {c, j}} 

= Pr{(rc listens) or {w transmits on channel c or j)} 


= l-p + p- - 


= 1 -P- 


5-2 

5 


= 1 - 


5-2 


s 

V 


( 21 ) 


Using (fl 8 l) . (1201) and (12TT) . we have: 


Pr{^(r, c) | ABC(t, c)} = 1 - fl-|— II f 1 ^ 

W&G 2 V P / w£G3 V 


s 

p 


si- n i-t n 

weG2 \ p / « jgg 3 \ p / 


1- n 1- 

W(zG2UG3 \ 

5-2 


=1- 1-P 


5-2 

1 

p / 
N—2 


( 22 ) 


We analyze the cases corresponding to when p = \ and when p = -£■— separately. 
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Case 1 (p = |): Using (|I7|) and (f 22)1 . we have: 


Pr{D(r, c) | ABC(r,c)} > i • • 11 - (l - } 

jiV > 31 





> 1 2 1 _ 1 
“ 4 ' 3 ' 3 ~~ 18 


• Case 2 (p = Using ([22]) . we have: 


Pr{U(r,c) | ifiC(r,c)} = 1 - 1 - 


5-2 


A, 


est 


N—2 


substitute x = 


v=^- 
” A es t 


< 


1 


> 1 - 1 - 


X 


A es i 2 
N—2 


using (USD, N — 2 > 


A. 


est 


>i-< l- 


r . S-2 

1 \ 1 2 

X 


1 \ 1 

1 -< — when x > 2 

x) e 


> 1 - 


S-2 

1 \~ 


> 2 


N-2> 


x>2 


x(S — 2 ) 


(23) 


Using (fI71) and (1231) . we have: 


Pr{L>(r, c) | ABC(t, c)} > 


1 /5 — 1' 


5 


1 - 


S-2 ■ 

1 \ ~ 


> - • 1-rr U l--= 


5 


V5 
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1 2 1 _ 1 
2 ' 3 ' 3 “ 9 


Let a(N, 5) denote the slow-down factor experienced by a neighbor discovery algorithm that 
is undergoing a jamming attack. The above analysis implies that cr(N, 5) < 18 for all N > 3 and 
5 > 3. For sufficiently large values of N and 5, we show that the slow-down factor is much smaller. 
Again, we analyze the cases corresponding to when p = \ and when p = separately. 

• Case 1 (p = l): Using (fI71) and (l22j) we have: 


Pr {D{t,c) | ABC(t,c)} > ^ - 


1 - 1 - 


5-2 

25 


N—2' 


_ rr . . , , 5 — 1 , ,5-2 

tor sufficiently large 5, ——— ~ 1 and 


5 


25 


‘ ' i 4 


TV—2 


for sufficiently large N, 


1 , ,1 

2 (1“0) = - 


N—2 


Case 2 (p = ^-y): Using (flTll and (l23l) . we have: 


Pr{D(r,c) | ABC(t,c)} > ^ 


S -2 * 


for sufficiently large 5, 


1 , , 1 
- * 1 ■ <i—°) = - 


5-1 

5 


1 and 


S—2 

1 \~ 


The above analysis implies that: 


Finally, we have: 


lim a(N, 5) < 2 

TV—>-oo 
5"—^oo 
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Synchronous Algorithms 


Asynchronous Algorithms 




Figure 5: Upper bound on slow-down factor as a function of N and S. 

Theorem 14. The running time of the synchronous algorithms increases by a factor of 18 in the 
worst case. Furthermore, when N and S are sufficient large, the running time only increases by a 
factor of at most 2 in the worst case. 

Note that the slow-down factor of 18 represents the worst-case and only occurs for a “small” 
system which is easier to jam. As N and S, increases, the slow-down factor decreases rapidly as 
depicted in Fig. [5j 

6.3.2 Asynchronous Algorithms 

In the asynchronous case, we require a minor change to Algorithm [5] and Algorithm [ 6 ]. Namely, 
when a node is in transmit mode, it makes a uniform selection of channel at the beginning of each 
slot, rather than at the beginning of each frame. We also assume that the length of a jamming 
round is at least the maximum length of a slot (measured in real-time) (which is given by 3 ^^ )■ 
Consider the link from node v to node u. Consider a legal pair of frames (/, g) with node(f) = v 
and node(g) = u. Let slot(f , g) denote the slot of / that lies completely within g. (If there are 
multiple such slots, then choose one arbitrarily.) For analyzing the jamming attack, we narrow the 
period during which no transmission by other neighbors of u interferes with u’s transmission during 
slot(f,g). This, in conjunction with the assumption made on the minimum length of jamming 
round, ensures that the set of channels a jammer observes to be in use in two different scans do not 
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have any correlation. Intuitively, this is because two consecutive scans by a jammer will involve 
two different slots of any node and, in transmit mode, a node now chooses the channel on which to 
transmit at the beginning of a slot (rather than a frame). 

Consider events A(f,g,c), B(f,g,c) and C(f,g,c). To analyze our algorithms under jamming 
attack, we replace the event A(f,g,c ) with a new event A'(f,g,c ) which denotes the condition 
that u transmits on c during slot(f,g). Further, we replace the event C(f,g,c ) with a new event 
C'(f,g, c) which denotes the condition that no neighbor of u besides v transmits on c during any of 
its slots that overlaps with slot(f,g). Clearly, in the absence of jamming attack, u will discover v 
during frame g if events A'(f,g,c), B(f,g,c) and C'(f,g,c) occur. Analogous to Lemma [5j it can 
be easily shown that a single slot of a node can overlap with at most three slots of another node. 
Note that these three slots can come from at most two different frames. It can be verified that ((1011 
holds for A'(f,g,c) and (fl2l) holds for C'(f, 9 ,c) (for the latter, see Section lA.71 in the appendix). 

Analogous to events ABC(t, c), D(t,c ) and E(t,c) and F(t,c,w ) define in the synchronous 
case, we can define events ABC(f,g,c), D(f,g,c), E(f,g,c ) and F(f,g,c,), respectively. Most 
of the analysis for the synchronous case is also applicable to the asynchronous case. Specifically, 
when p = b, the analysis and results for the synchronous case also apply to the asynchronous case. 
However, when p = - , the results for the asynchronous case are slightly different from that for 

the synchronous case. Since the analysis is very similar to that for the synchronous case, we only 
present the main steps. Using an expression similar to (122(1 . when p = - . we can show the 

following: 


Pr{£(/, g, c) | ABC{f, g, c)} = 1 - 1- 


S-2 


3A 


est 


N—2 


> i - 


For arbitrary N and S with N, S > 3, we have: 


Pr {D(f,g,c) | ABC(f,g,c)} > 


1 fS -1 


5 


> 2' V 1 5 


1 2 1 _ 1 
2 ’ 3 ’ 7 ~ 21 


1 - 


1 - 


1 

e 

1 


S—2 ■ 
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For sufficiently large N and S, we have: 


Pr g, c) | ABC(f, g , c)} > ^ • (— 5 “) ' | 1 - Q) j 

.1 , x 1 

-r 1 -( 1 -°) = 2 

The above analysis implies that: 

VIV, VS : N > 3, S >3 : a{N,S) < 21 and lim a(N, S) < 2 

iV->oo 
S'—>-oo 

Finally, we have the following result: 

Theorem 15. T/ie running time of the asynchronous algorithms increases by a factor of 21 in the 
worst case. Furthermore, when N and S are sufficient large, the running time only increases by a 
factor of at most 2 in the worst case. 

Again, as in the synchronous case, the slow-down factor of 21 represents the worst-case and 
only occurs for a “small” system which is easier to jam. As N and S, increases, the slow-down 
factor decreases rapidly as depicted in Fig. [5) 

6.3.3 Discussion 

It is possible for a jammer to jam a band of channels and not just a single channel. In that case, 
we require the the available channel set to contain at least three orthogonal bands of channels. The 
results derived above still hold if the analysis is conducted in terms of bands rather than channels 
and all bands are of the same size. 

It is also possible to analyze the running time of our algorithms when a jammer can jam up to 
k channels in a jamming round for a fixed k, where 1 < k < §. The analysis is more involved and 
requires stronger assumptions on system size. The slow-down factor also increases to 0(k ) and is 
no longer a constant. 
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7 Related Work 


A large number of neighbor discovery algorithms have been proposed in the literature. To our 
knowledge, most of the algorithms suffer from one or more of the following limitations: 

• Single-Channel Network: Several neighbor discovery algorithms have been proposed for a 
(traditional) single channel wireless network ( e.g ., [3lf^l[9l [2^1 [25l[3T1 H2l[5TL[56l - f58l[6T1 [6^1 [6T1 
[69j). Some of these neighbor discovery algorithms, such as those proposed in [56,[58], can 
be extended to work for a multi-channel network (including a heterogeneous network). The 
main idea is as follows. Let the collective set of all channels over which radio nodes in the 
network are capable of operating be referred to as the universal channel set. The main 
idea is to execute a separate instance of single-channel neighbor discovery algorithm on all 
channels in the universal channel set concurrently by interleaving their executions. A node 
only participates in instances that are associated with channels in its available channel set. 
However, this simple approach has several disadvantages. First, it requires that all nodes have 
to agree on the composition of the universal channel set. Second, the time complexity of the 
algorithm for multi-channel network (obtained as above) will always be linear in the size of 
the universal channel set. This is true even if the available channel set of all nodes contain a 
single common channel (an extreme case). In many cases, the available channel sets of nodes 
may be much smaller than the universal channel set. Third, all nodes should start executing 
the algorithm at the same time. Otherwise, different nodes may tune to different channels in 
the same time slot, thereby causing the multi-channel neighbor discovery algorithm to fail. 
As such, this transformation does not work for an asynchronous system. In comparison, in 
our algorithms, nodes do not need to agree on a universal channel set and the time complexity 
of our algorithms depends on the “degree of heterogeneity” in the network. 

• Single-Hop Network: Neighbor discovery algorithms in mm consider a single-hop network 
in which all nodes can communicate with each other directly. Our algorithms, on the other 
hand, work even for a multi-hop network. 

• Synchronous System: Neighbor discovery algorithms in [1711331155] assume a time-slotted syn¬ 
chronous system. Work in [3511441166] assume a synchronous system in which all nodes initiate 
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neighbor discovery at the same time. Moreover, the proposed algorithms are deterministic 
in nature, and have high time complexity that depends on the product of network size and 
universal channel set size. In contrast, our focus in this work is on developing algorithms 
for an asynchronous system that have lower running time (albeit provide only probabilistic 
guarantees). 

• Homogeneous Channel Availability: Neighbor discovery algorithms in [21 1171154] assume ho¬ 
mogeneous channel availability (all channels are available to all nodes). Our algorithms, on 
the other hand, work even when different nodes have different subsets of channels available 
for communication, and, further, their running time adapts to the “degree of heterogeneity” 
in the network. 

• Little or No Theoretical Guarantees: Work in f4Eli.6Q] only evaluate the proposed neighbor 
discovery algorithms using experiments; no attempt is made to provide any theoretical (de¬ 
terministic or probabilistic) guarantees. Work in |6] provides a suite of neighbor discovery 
algorithms that can tolerate jamming attacks. However, it does not provide any theoretical 
guarantees either (algorithms are evaluated using only simulation experiments). In our work, 
in contrast, we analyze all our algorithms theoretically and establish probabilistic bounds on 
their performance. 

Besides the related work mentioned above, Raniwala and Chiueh propose a neighbor discovery 
algorithm for a multi-channel wireless network in [49] but their work assumes that each node has 
multiple interfaces. Law et al. describe a neighbor discovery algorithm for constructing a scatternet 
in [36]. Salondis et al. describe an algorithm for two-node link formation in [52], which is then used 
as a building block for forming a scatternet in [53] . A scatternet is an ad hoc network consisting 
of two or more piconets; a piconet is an ad hoc network consisting two or more (up to eight) 
Bluetooth-enabled devices with one device acting as master and remaining devices acting as slaves. 
Work in [HlITTlilHlEnillHlEniSIlllS] addresses the closely-related problem of rendezvous between two 
CR nodes. Intuitively, the rendezvous problem between two nodes involves ensuring that the two 
nodes tune to the same channel at the same time, which is necessary for nodes to discover each 
other. However, as opposed to the problem considered in this work, the rendezvous problem does 
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not consider the effect of possible simultaneous transmissions by other neighboring nodes at the 
same, which may cause collisions thereby preventing a link from being discovered. 

8 Conclusion 

In this work, we have proposed several randomized neighbor discovery algorithms for an M 2 HeW 
network both for synchronous and asynchronous systems under a variety of assumptions. Our 
algorithms guarantee success with arbitrarily high probability. All our algorithms are designed to 
minimize the amount of knowledge that nodes need to possess a priori before they start executing 
the neighbor discovery algorithm. We have shown that our neighbor discovery algorithms are robust 
to unreliable channels are adversarial attacks. 

As a future, we plan to develop randomized algorithms for solving other communication prob¬ 
lems for an M 2 HeW network such as broadcasting and convergecasting that guarantee success with 
arbitrarily high probability. 
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A Omitted Proofs 

A.l Computing the probability of occurrence of A(r, c) 
We have: 


Pr{A(r, c)} = (v selects c at the beginning of r) A (v chooses to transmit during r) 
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| using ([2D, 2 k 1 < A (u,c) which implies 2 k < 2A (u,c) 
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A.2 Computing the probability of occurrence of B(t,c ) 

We have: 


Pr{5(r, c)} = (u selects c at the beginning of t) A (u chooses to listen during r) 
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A.3 Computing the probability of occurrence of C(r,c) 

Let J\f(u, c ) denote the set of neighbors of u on c. Note that, if A f(u, c) only contains v, then 
Pr{C(r, c)} = 1. Otherwise, we have: 


Pr{C(r,c)} = n Pr(rc does not transmit on c during 
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| c)\ — 1 = A(it, c) — 1 and, using ([ 2 ]), A(it, c) — 1 < 2 k 



{Vx > 2,(1 — i ) x is a monotonically increasing function of x and thus > 5 } 
1 

> - 

“ 4 


A.4 Computing the probability that s covers (v,u) 

Let F(t, c) denote the event that r covers ( v,u ) on c. Note that F{t, c) = A(t, c) AB(t, c) AC(r, c). 
Since A(r,c), B(t,c ) and C(r,c ) are mutually independent events, Pr{F(r, c)} = Pr{A(r, c)} x 
Pr{L?(r, c)} x Pr{C(r, c)}. We have: 


Now, we have: 
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A. 5 Computing the probability of successful discovery of a link in 
stages 

Let M = 16 ma ^A) In . We have: 


16 max(5',A) ( V 2 


Pr((u,u) is not covered within M stages) 

= Pr((u, u) is not covered in the i th stage) 
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A.6 Computing the sum of the number of time-slots/frames in the first k epochs 

Let 5 denote the sum Xi<j<fc(* + 1) ’ 2*. We have: 


5 = 2 • 2 1 + 3 • 2 2 + 4 • 2 3 + ••• + (A; + 1)-2 a 


25 = 


2 • 2 2 + 3 • 2 3 + • • • + k ■ 2 k + (k + 1) • 2 k+l 


Subtracting the first equation from the second and rearranging terms, we obtain: 


S = (k + 1)- 2 k+1 - (2 k + 2 k ~ l + • • • + 2 2 ) - 2 • 2 1 
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A.7 Computing the probability of occurrence of c) 

We focus on the case when exactly three slots of w overlap with slot(f,g ). But the result also 
holds if fewer than three slots of w overlap with slot(f,g). There are two sub-cases: the three 
(overlapping) slots belong to the same frame or two different frames. 


Sub-case 1 (three slots belong to the same frame): In this case, we have: 


Pr{u; does not transmit on c during slot(f, g)}} = 1 — p + p ■ 
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Sub-case 2 (the three slots come from two different frames): In this case, we have: 


Pr{u; does not transmit on c during slot(f,g)} = < 1 — p + p 
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It can now be shown, in a similar manner as in the proof of LemmaEl that Pr{C"(/, g, c)} > 
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